A Method of Incorporating Bigram Constraints into an LR Table and Its E ectiveness in Natural Language Processing

نویسندگان

  • Hiroki Imai
  • Hozumi Tanaka
چکیده

In this paper, we propose a method for constructing bigram LR tables by way of incorporating bigram constraints into an LR table. Using a bigram LR table, it is possible for a GLR parser to make use of both bigram and CFG constraints in natural language processing. Applying bigram LR tables to our GLR method has the following advantages: (1) Language models utilizing bigram LR tables have lower perplexity than simple bigram language models, since local constraints (bigram) and global constraints (CFG) are combined in a single bigram LR table. (2) Bigram constraints are easily acquired from a given corpus. Therefore data sparseness is not likely to arise. (3) Separation of local and global constraints keeps down the number of CFG rules. The rst advantage leads to a reduction in complexity, and as the result, better performance in GLR parsing. Our experiments demonstrate the e ectiveness of our method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Method of Incorporating Bigram Constraints into an LR Table and Its Effectiveness in Natural Language Processing

In this paper, we propose a method for constructing bigram LR tables by way of incorporating bigram constraints into an LR table. Using a bigram LR table, it is possible for a GLR parser to make use of both bigram and CFG constraints in natural language processing. Applying bigram LR tables to our GLR method has the following advantages: (1) Language models utilizing bigram LR tables have lower...

متن کامل

Incorporating Lr Parsing into Sphinx

This paper describes the integration of an LR natural language parser with the SPHINX speech recognition system. SPHINX is one of the most successful speech recognition systems in use today. Although it attains high word accuracy, SPHINX often outputs ungrammatical recognition results because the baseline SPHINX system uses very simple word-pair or bigram language models. For applications of sp...

متن کامل

Using Generalized Language Model for Question Matching

Question and answering service is one of the popular services in the World Wide Web. The main goal of these services is to finding the best answer for user's input question as quick as possible. In order to achieve this aim, most of these use new techniques foe question matching. . We have a lot of question and answering services in Persian web, so it seems that developing a question matching m...

متن کامل

Incorporating E-learning in teaching English language to medical students: exploring its potential contributions

Background: The spread of technology has influenced different aspects of human life, and teaching and learning are not exceptions. This study aimed to examine the potential contribution of the use of technology in teaching English language to medical students.   Methods: This qualitative-action research study was conducted in Birjand University of Medical Sciences (BUMS), with 60 medica...

متن کامل

LR Parsers For Natural Languages

MLR, an extended LR parser, is introduced, and its application to natural language parsing is discussed. An LR parser is a ~;hift-reduce parser which is doterministically guided by a parsing table. A parsing table can be obtained automatically from a context-free phrase structure grammar. LR parsers cannot manage antl)iguous grammars such as natural language grammars, because their I)arsing tab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1986